Suitability of Signature Indexing Over the World Wide Web
نویسندگان
چکیده
Signature indexing has been studied extensively in text database or other databases for many years. The main advantages of a signature le as an access index are its small size, distributability, the ability to index information of a wide variety of types, ease of maintenance, and the ability to provide fuzzy indexing. These features are precisely what are needed for a good access index for indexing the World Wide Web and web-related large data depositories such as web caches and search engines. However, there has not been a study to comprehensively compare the features of signature indexing with the needs of web indexing. This paper compares the features of signature indexing with the needs of access indexes over the Web. The comparison provides a convincing argument that signature indexing is very suited for indexing information over the Web.
منابع مشابه
Melody based tune retrieval over the World Wide Web
In this paper we describe the steps taken to develop a Web-based version of an existing stand-alone, single-user digital library application for melodical searching of a collection of music. For the three key components: input, searching, and output, we assess the suitability of various Web-based strategies that deal with the now distributed software architecture and explain the decisions we ma...
متن کاملA Unified Approach to Indexing Multimedia on the Web
Indexing multimedia Web documents can be regarded as an important part of Web engineering, a concept first proposed [19] by one of the authors and his collaborators in 1998 at the World Wide Web WWW7 conference in Brisbane, Australia. Contentbased indexing of multimedia has always been a challenging task. The enormity and diversity of the multimedia content on the World Wide Web (WWW) adds anot...
متن کاملDynamic and Distributed Indexing Architecture in Search Engine using Grid Computing
Search engines require computers with high computation resources for processing to crawl web pages and huge data storage to store billions of pages collected from the World Wide Web after parsing and indexing these pages. The indexer is one of the main components of the search engine that come intermediate between the crawler and the searcher. Indexing is the process of organizing the collected...
متن کاملDistributed Digital Library Architecture Incorporating Different Index Styles
The New Zealand Digital Library offers several collections of information over the World Wide Web. Although fulltext indexing is the primary access mechanism, musical collections can also be accessed through a novel melody retrieval system. In offering this service over a three-year period, we have had to face many practical challenges in building, maintaining, and administering diverse collect...
متن کاملTrends for Web Information Processing over World Wide Web
Web information processing through modern search engines index zillions of web pages on distributed platforms of thousands of commodity web users. Much of the research has been done on the information processing aspects ranging from crawling, web graph topology, indexing, efficient query processing, caching and ranking. Despite all of the challenges, the expansion of the web has turned informat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999